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Abstract 

Background: The Leishmaniases are a group of clinically diverse diseases caused by parasites of the genus 
Leishmania. To distinguish between species is crucial for correct diagnosis and prognosis as well as for treatment 
decisions. Recently, sequencing of the HSP70 coding region has been applied in phylogenetic studies and for 
identifying of Leishmania species with excellent results. 

Methods: In the present study, we analyzed the 3'-untranslated region (UTR) of Leishmania HSP70-type I gene from 
24 strains representing eleven Leishmania species in the belief that this non-coding region would have a better 
discriminatory capacity for species typing than coding regions. 

Results: It was observed that there was a remarkable degree of sequence conservation in this region, even 
between species of the subgenus Leishmania and Viannia. In addition, the presence of many microsatellites was a 
common feature of the 3'-UTR of HSP70-I genes in the Leishmania genus. Finally, we constructed dendrograms 
based on global sequence alignments of the analyzed Leishmania species and strains, the results indicated that this 
particular region of HSP70 genes might be useful for species (or species complex) typing, improving for particular 
species the discrimination capacity of phylogenetic trees based on HSP70 coding sequences. Given the large size 
variation of the analyzed region between the Leishmania and Viannia subgenera, direct visualization of the PCR 
amplification product would allow discrimination between subgenera, and a Haelll-PCR-RFLP analysis might be used 
for differentiating some species within each subgenera. 

Conclusions: Sequence and phylogenetic analyses indicated that this region, which is readily amplified using a 
single pair of primers from both Old and New World Leishmania species, might be useful as a molecular marker for 
species discrimination. 
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Background 

Protozoan parasites of Leishmania genus cause a severe 
disease, Leishmaniasis, which threatens 350 million 
people worldwide and some 2 million new cases occur 
yearly. Furthermore, mortality and morbidity from Leish- 
maniasis worldwide show a worrying increasing trend 
[1]. Infection by the parasite Leishmania can result in a 
broad spectrum of pathological outcomes in the human 
host, ranging from simple self-healing cutaneous lesions 
(cutaneous Leishmaniasis, CL) to acute visceral Leish- 
maniasis (VL), commonly referred to as kala-azar. The 
different pathologies usually correlate with infection by 
different species [2]. Thus, Leishmania donovani and 
Leishmania infantum usually cause VL, whereas the rest 
of Leishmania species generally cause cutaneous Leish- 
maniasis (CL). Nevertheless, exceptional cases have been 
described, such as visceral outcomes in individuals 
infected with L. tropica. After self-cure or successful 
chemotherapy, many patients remain asymptomatically 
infected; however, such persistence often gives rise to 
disease reactivation, including post-kala-azar dermal 
Leishmaniasis after cure of VL caused by L. donovani, 
diffuse cutaneous Leishmaniasis (DCL) lesions in L. mex- 
icana -infected people, or the development of destructive 
mucocutaneous Leishmaniasis (espundia) months or years 
after healing of a localized cutaneous ulcer caused by L. 
braziliensis [1]. 

On the other hand, treatment outcome in patients can 
vary with the infecting Leishmania species because of 
different sensitivity of the parasite species both to the 
standard drugs, pentavalent antimonials and miltefosine, 
and those on clinical trial, paromomycin [3]. Thus, cor- 
rect diagnosis of the infecting Leishmania species is cru- 
cial for making decisions regarding prognosis and 
treatments, and also for epidemiological monitoring of 
the parasite spread [4]. 

To date, around 20 Leishmania species are known to 
infect humans, even though the species status is under 
discussion for some of them [1,5]. In fact, a major con- 
ceptual problem in epidemiology of Leishmaniasis is the 
taxonomy of the etiological agent as Leishmania is a 
morphologically uniform genus. At present, multilocus 
enzyme electrophoresis (MLEE), an isoenzyme analysis 
that is based on 13 enzymes, is the reference method for 
Leishmania typing [1]. However, MLEE has important 
limitations: strains with the same enzyme phenotype 
may have distinct amino acid sequences, the degree of 
relationship between different phenotypes is not known, 
and putative heterozygous phenotypes are difficult to in- 
terpret [6,7]. Furthermore, this technique is time con- 
suming, requires mass parasite culturing, and it should 
be done by one of the few reference centres [1]. There- 
fore, its usefulness for taxonomic purposes is under dis- 
pute [8]. In conclusion, new methods for species 



identification are needed to ensure proper identification 
and therapy. 

Molecular biology techniques, such as PCR and DNA 
sequencing, are replacing traditional methods of tax- 
onomy, as they tend to be more specific and easier to 
perform. Thus, sequencing of genes such as DNA 
polymerase a [9], RNA polymerase II [10], gp63 [11], 
cytochrome oxidase II [12,13], N-acetylglucosamine- 
1 -phosphate transferase [14], cystein protease b [15], 
cytochrome b [16,17], SSU rDNA [18], and 7SL RNA 
[19] has been carried out for phylogenetic studies. In 
addition, sequencing of ribosomal DNA internal tran- 
scribed spacers [20,21] and kinetoplast DNA minicir- 
cles [22] is being used for taxonomic purposes within 
the Leishmania genus. Remarkably, in recent studies, 
coding regions of genes for most metabolic enzymes 
used in the MLEE typing method have been deter- 
mined and used for Leishmania typing [6,23,24]. As 
expected, these studies have demonstrated that DNA 
sequencing has greater discriminatory power than 
MLEE typing. Accordingly, this approach, based on 
sequence analysis of selected genes, has been dubbed 
MLST (for "multilocus sequence typing") [6]. 

The HSP70 gene is being extensively used as a target 
for PCR-RFLP assays for Leishmania species discrimin- 
ation [25,26]. In fact, Montalvo and co-workers [27] have 
designed a PCR-RFLP method allowing identification of 
most medically relevant Old and New World Leishmania 
species on the basis of HSP70 PCR amplification. More 
recently, sequence analysis of the 1380-bp fragment of 
the HSP70 coding region, commonly used in the PCR- 
RFLP method, has been directly used for phylogenetic 
and taxonomic purposes [26,28]. The phylogenetic trees 
obtained by da Silva et al [26] and Fraga et al [28] dem- 
onstrate that DNA sequencing of HSP70 coding regions 
represent a valuable technique for Leishmania species 
identification. As a remarkable result from these studies, 
Fraga and coworkers [28] defined eight, highly sup- 
ported, monophyletic groups within the genus Leish- 
mania: (1) L. donovani/L. infantum, (2) L. tropica/L. 
aethiopica, (3) L. major, (4) L. mexicana/L. amazonensis, 
(5) L. naijfi, (6) L. braziliensis/L. peruviana, (7) L. guya- 
nensis/L. panamensis, and (8) L. lainsoni. Nevertheless, 
the HSP70 protein and the coding region of its gene are 
among the best conserved sequences along the evolution- 
ary tree of life [29]. As untranslated regions (UTR) of 
genes are expected to have a lower evolutionary constric- 
tion than coding regions (CDR), it is conceivable that 
UTR sequences may be a better target than CDR for typ- 
ing of closely related organisms [30]. In the present study, 
we sequenced most of the 3'-untranslated region (UTR) 
of Leishmania HSP70-type I (HSP70-I) gene from differ- 
ent species and strains with the belief that this region 
might be useful for species typing within the genus, 
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having hopefully a more discriminating capacity than the 
HSP70 coding region. 

Methods 

DNA extraction, amplification, and cloning 

The Leishmania isolates used in this study are listed in 
Table 1. Promastigotes were cultivated in RPMI medium 
supplemented with 10% heat-inactivated fetal bovine 
serum at 26°C. Total genomic DNA was extracted from 
promastigotes using a standard sodium dodecyl sulfate- 
proteinase K procedure as previously described [31]. For 
amplification, the primers 70-IR-D (5'- CCAAGGTCGA 
GGAGGTCGAC TA- 3') and 70-IR-M (5'- ACGGG 
TAGGG GGAGGAAAGA -3') were used (Figure 1). 
Amplifications were done with the Maxime PCR PreMix 
Kit (i-Taq; Intron Biotechnology), using 20-40 ng of 



genomic DNA, dNTPs (2.5 mM each) and 0.5 [iM of 
each primer. The PCR profile was 95°C for 2 min fol- 
lowed by 30 cycles of 95°C for 30 sec, 62.5°C for 30 sec, 
72°C for 1 min and 20 sec, and then a final elongation 
step at 72°C for 5 min. The PCR products were checked 
by agarose gel electrophoresis and purified using a com- 
mercial DNA purification kit (FavorPrep Gel/PCR Puri- 
fication Kit; Favorgen), following the manufacturers 
instructions. 

For cloning, the purified PCR products were ligated 
into the pCR2.1 vector using the TA Cloning Kit (Invi- 
trogen), and the ligation product used to transform XL1- 
blue competent bacteria [35]. DNA sequencing was car- 
ried out in the facilities of the Servicio de Genomica 
(Parque Tecnologico de Madrid-UAM), using Big Dye 
Terminators v3.1 kit (Applied Biosystem). 



Table 1 Reference strains and sequences used in this study 



Species 


WHO code Origin 


Sequence ID 


Size (bp) 


Label 3 


L. oethiopico 


MHOM/ET/82/101-82 Ethiopia 


HE575325 b 


696 e 


Lae-101 


L oethiopico 


MHOM/ET/82/652-82 Ethiopia 


HE575326 b 


701 


Lae-652 


L oethiopico 


MHOM/ET/72/L100 Ethiopia 


HE575327 b 


700 


Lae-L100 


L omozonensis 


MHOM/BR/77/LTB001 6/C1S1 Brazil 


L14605 c 


703 e 


Lam-LBT 


L omozonensis 


IFLA/BR/67/PH8 Brazil 


HE575328 b 


700 


Lam-PH8 


L. braziliensis 


MHOM/BR/75/M2904 Brazil 


Contig LbrM28_V2_October 
(1081948-1082464, minus strand) d 


517 e 


Lb-M2904 


L braziliensis 


MHOM/BO/00/CUM45 Bolivia 


HE575329 b 


518 


Lb-CUM45 


L donovoni 


MHOM/IN/80/DD8 India 


HE575330 b 


718 


Ld-DD8 


L donovoni 


MHOM/IN/00/DEVI India 


HE575331 b 


724 


Ld-DEVI 


L donovoni 


MHOM/ET/67/HU3 Ethiopia 


HE575332 b 


711 e 


Ld-HU3 


L donovoni 


IMRT/KE/62/LRC-L57 Kenya 


HE575333 b 


715 


Ld-LCR 


L guyonensis 


MHOM/PE/91/LC1446 Peru 


HE575334 b 


585 e 


Lg-LC1446 


L. infantum 


MCAN/ES/98/LLM-877 (JPCM5) Spain 


Contig LinJ28_20070420_V3 (1094043-1094771, 
minus strand) d 


729 e 


Li-JPCM5 


L infantum 


MHOM/FR/78/LEM-75 France 


HE575335 b 


731 


Li-LEM75 


L infantum 


MCAN/ES/01/LLM-1050 Spain 


HE575336 b 


733 


U-LLM1050 


L. infantum 


MHOM/ES/05/LLM-1 525 Spain 


HE575337 b 


731 


U-LLM1525 


L lain son i 


MHOM/BO/95/CUM71 Bolivia 


HE575338 b 


589 e 


LI-CUM71 


L. major 


MHOM/IL/80/Friedlin Israel 


LmjF28_01_20050601_V5.2 (1059883-1060597, 
minus strand) d 


715 e 


Lmj-Friedlin 


L major 


MRHO/SU/59/P-STRAIN Former Soviet Union 


HE575339 b 


715 


Lmj-P-STRAIN 


L major 


MHOM/SU/73/5-ASKH Former Soviet Union 


HE575340 b 


718 


Lmj-5-ASKH 


L ponamensis 


MHOM/COL/81/L13 Colombia 


LpnLscaffold351 (149539-150125, minus strand/ 


587 


Lpa-L13 


L. peruviana 


MHOM/PE/90/LH249 Peru 


HE575341 b 


516 e 


Lp-LH249 


L tropica 


MRAT/IQ/72/ADHANIS1 Iraq 


HE575342 b 


704 e 


Lt-ADHANIS1 


L tropica 


MHOM/SU/74/K27 Former Soviet Union 


HE575343 b 


702 


Lt-K27 



a Label for strain identification in Figure 3. 

b New sequences submitted to EMBL-EBI database. 

c GenBank accession number. 

d GeneDB identificator. 

e Sequences depicted in Figure 2. 

f TriTrypDB identificator (www.tritrypdb.org). 
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| CDR (HSP70-I) | — 3 '- UTR , 5'-UTRj CDR (HSP70-I or -II) ~| 

« IR 
70-/A-M 

Figure 1 Schematic map of the Leishmania HSP70 locus (fragment), showing the region amplified in this study. The map was drawn to 
scale [39]. PCR primer annealing sites are indicated by arrows. CDR, protein coding region; 5'- and 3'-UTR, untranslated regions; IR, intergenic 
region. 



Sequence analysis 

The chromatograms were visualized using the BioEdit 
Sequence Alignment Editor [36]. For all the clones, se- 
quencing of both strands was carried out. 

For all analyses, the priming sites were trimmed from 
both ends of all sequences. A search for microsatellite 
sequences was performed using the "Microsatellite 
repeats finder" program [37]. Parameter settings were: 
length of repeated sequence (minimum, 2; maximum, 6); 
minimum number of repeats, 3; minimum length of tan- 
dem repeat, 6; and allowed percentage of mismatches, 
10%. 

All sequences were multiple aligned with the default 
option using ClustalW2.1 at The European Bioinformat- 
ics Institute [33]. For phylogenetic studies, the multiple 
sequence alignment was analyzed using the software 
MEGA4 [32]. The evolutionary history was inferred 
using the Neighbor-Joining and Minimum Evolution 
methods [34], and the bootstrap consensus tree inferred 
from 1000 replicates. The evolutionary distances were 
computed using the Maximum Composite Likelihood 
method [38] and the units represent the number of base 
substitutions per site. All positions containing gaps and 
missing data were eliminated from the dataset. 

Sequence data 

Sequences generated in this study have been deposited 
in the EMBL-EBI database under accession numbers 
HE575325-HE575343 (Table 1). 

Results and discussion 

PCR amplification of non-coding sequences of Leishmania 
HSP70-I genes 

The HSP70 locus in Leishmania consists of several 
HSP70 gene copies arranged in a head-to-tail manner 
[39,40]. All the genes are highly conserved in sequence 
with the exception of the gene located at the 3' end of 
the cluster, which presents a 3'-UTR different to the rest 
of genes. Hence, the latter gene is referred to as HSP70- 
II gene and the other genes as HSP70-I genes. Given the 
high sequence conservation of HSP70 CDR among the 
different Leishmania species [41], we designed oligonu- 
cleotides at the 5' and 3' extremities of the CDR with the 
purpose of PCR amplify the entire intercoding region of 



HSP70-I genes. The intercoding region (3'-UTR + inter- 
genic region (IR) + 5'-UTR) in the L. infantum HSP70 
locus is 1600 bp in length [39]. However, the task turned 
out to be complicated, as it was only possible to amplify 
and clone the expected region for a few Leishmania spe- 
cies. In order to overcome this difficulty, we searched for 
conserved sequence blocks within the 3'-UTR of HSP70- 
I genes from different Leishmania species, whose 
sequences were available in the databases. Finally, we 
succeeded and designed an oligonucleotide pair able to 
efficiently amplify a 3'-UTR sequence from all the Leish- 
mania DNA samples, irrespective of the subgenus 
{Leishmania or Viannia) to which they belonged. See 
Figure 1 for details on the location of amplification pri- 
mers 70-IR-D and 70-IR-M in the HSP70 locus. 

The UTR-I fragment for 20 strains of different Leish- 
mania species were PCR- amplified and cloned. However, 
it was not possible to obtain a complete sequence for the 
strain MHOM/GR/80/GR-L35 of L. tropica, even after 
using alternative sequencing protocols in which DMSO 
or betaine was added [42]. As a result, this strain was 
not included in subsequent studies. For many of the 
other sequences, the sequence chromatogram was not 
uniform, showing a drastic decrease in intensity after 
passing GC-rich regions. This fact may be an indication 
that the UTR-I fragment contains regions with strong 
secondary structures that were hindering (or totally 
blocking in one case, as mentioned above) the polymer- 
ase advance during sequencing reaction. 

In addition to the 19 sequences obtained in this work, 
we retrieved 5 sequences from databases, amounting to a 
total of 24 sequences (Table 1). It should be noted that 
sequence information of this region is not yet available 
in L. mexicana [43], L. donovani [43], and L. tarentolae 
[44] genome sequencing projects, which are ongoing. 
The HSP70-UTR-I amplified region varied in size from 
516 (in L. peruviana) to 733 bp (in L. infantum). Thus, 
the region is clearly shorter in Leishmania species of the 
subgenus Viannia than in species of the subgenus Leish- 
mania, and this fact might be exploited as a PCR-based 
method for quick discrimination between both Leish- 
mania subgenera. Within a given species, the length of 
this region shows a very low inter-strain variation; for ex- 
ample, the size of the region in four different L. donovani 
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strains varies from 711 to 718, in L. infantum (four 
strains) ranges from 729 to 733, in L. major (3 strains) 
ranges from 715 to 718, and in L. aethiopica (3 strains) 
ranges from 696 to 701. Within a given Leishmania spe- 
cies, size variants are a consequence of variations in the 
repeat number of microsatellite sequences (see Figure 2). 

The HSP70 UTR-I contains a remarkable abundance of 
microsatellites 

A search for structural motifs along the sequenced UTR-I 
fragment in the different Leishmania species was carried 
out. The study showed a significant richness in microsatel- 
lites sequences (Figure 2). Microsatellites are tandemly 
repeated stretches of short nucleotide motives of 1-8 bp 
ubiquitously distributed in eukaryotic genomes [45]. The 
Leishmania genome is not an exception, and multilocus 
microsatellite typing (MLMT) is being used as a powerful 
tool for population genetic and epidemiological studies of 
Leishmania [46]. In the L. infantum, L. major and L. brazi- 
liensis genomes, the CA repeats represent by far the most 
common microsatellite [47]. Indeed, CA-microsatellites 
(read as TG or GT on the complementary strand) were 
the more abundant class in the UTR-I of all Leishmania 
species, showing most of them have an antisense orienta- 
tion (Figure 2). TGC-microsatellites, another abundant re- 
peat in Leishmania genomes [47], were found in the 
Leishmania species of the subgenus Leishmania but absent 
in the species of the subgenus Viannia. However, the ana- 
lyzed HSP70 UTR-I fragment contains many other micro- 
satellites (C, GC, GTG, GGC, GCG) that may be 
categorized as uncommon for Leishmania. 

The HSP70 locus in T. cruzi, T. brucei and Leishmania 
spp. has a similar genomic organization, i.e. it contains 
several genes that are head-to-tail tandemly organized. 
However, the UTRs of HSP70 genes are extremely diver- 
gent between Trypanosoma and Leishmania genera. 
Thus, the 3'UTR of HSP70 genes in T brucei and T 
cruzi are only 186 and 213 nucleotides in length, respect- 
ively [48,49]. In contrast, the 3'-UTR of Leishmania 
HSP70 genes are higher than 1000 nucleotides in length 
for species of the Leishmania subgenus [39], and 936 
nucleotides for the 3'-UTR-I in Leishmania braziliensis 
[30]. These findings suggest that UTRs of HSP70 genes 
have evolved towards opposite ends regarding sequence 
complexity in Trypanosoma and Leishmania. On the 
other hand, from the analysis of Figure 2, it became clear 
that the analyzed region of the HSP70 3'UTR has experi- 
enced also a notable evolution after separation of sub- 
genus Leishmania and Viannia. In the species of the 
subgenus Leishmania, this region has grown both in size 
and microsatellite complexity. Interestingly, the regula- 
tion of HSP70-I expression seems to be more complex in 
species of the subgenus Leishmania than in L. brazilien- 
sis [40]. Thus, it would be interesting to analyze the 



contribution of the different microsatellites to the spe- 
cific regulation of HSP70-I genes in Leishmania. 

Phylogenetic relationships of Leishmania species based on 
sequences of the HSP70-I UTR 

From the structural analysis of this region in the different 
Leishmania species, it became evident that this fragment 
of HSP70-I genes has greatly diverged between the sub- 
genus Leishmania and Viannia (Figure 2). In order to 
study the usefulness of these sequences to discriminate at 
the level of species within each subgenus, phylogenetic 
analyses were carried out (Figure 3). The inferred phylo- 
genetic tree based on either the Neighbor-Joining or 
Minimum Evolution methods indicated that the sequenced 
region may be useful for clustering Leishmania strains at 
the level of species or complex of species (99 to 100% boot- 
strap support). Not surprisingly, the subgenera Leishmania 
and Viannia constitute two distinct monophyletic lines. In 
addition, our phylogenetic analyses gave a robust support 
for the following Leishmania species or clusters: L. amazo- 
nensis (which forms a separated branch from the Old 
World Leishmania species), L. infantum/L. donovani (L. 
donovani complex), L. major, L. tropica and L. aethiopica, 
within the Leishmania subgenus; and L. braziliensis/L. per- 
uviana (L. braziliensis complex), L guyanensis/L. panamen- 
sis (L. guyanensis complex), and L. lainsoni, within the 
Viannia subgenus. With few differences, our phylogenetic 
tree was coincident with that obtained by Fraga et al. [28] 
using sequences of the Leishmania HSP70 CDR. Never- 
theless, our phylogenetic results support a separation 
between L. tropica and L. aethiopica (99% of boot- 
strap value; Figure 3), whereas they cannot be distin- 
guished based on HSP70 coding sequences. 

Separation between L. donovani and L. infantum strains 
remains as an issue to be resolved, even though several 
comprehensive studies have been conducted [20,46,50- 
53]. In our phylogenetic tree (Figure 3), separation of L. 
donovani and L. infantum strains was observed, even 
though the bootstrap value (50-52%, depending on the 
phylogenetic method) was not statistically significant. In 
the future, we plan to analyze a large number of L. dono- 
vani and L. infantum strains in order to define signatures 
within the 3'-UTRI that could allow discriminating specif- 
ically among both species (or subspecies). 

Although sequencing is becoming a cost effective 
methodology, it is still not feasible for routine diagnosis, 
being more attractive other methodologies such as re- 
striction fragment length polymorphisms of PCR ampli- 
cons (PCR-RFLP). This methodology has been found to be 
valuable for Leishmania species discrimination, using 
amplicons from the HSP70 coding region [26,27]. We ana- 
lyzed the sequence of the UTR-I amplicons described in 
this study searching for restriction enzyme cleavage sites 
that could be used for PCR-RFLP-based identification of 
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(CT)e (GC\ (C), (CA) 4 (TG) 3 (CA) 5 

L.infantum \ U U II Ull I I I I U 729 nt 

(GTG) 5 (C)„ (GCG) 5 (TO),, (TGC) 3 (GT), S 



(CT) 6 (GO (CA) 4 (TG) 3 (CA) 5 

L. donovani \ II I I II I I n I I I I l"l I 712 nt 



(GTG) 5 (C) 24 (GCG) 5 (TG)„ (TGC) 3 (GT) 1C 



(CT) a (TG) 3 (GC^ (CA) 3 (TG) 3 

L major I ■ IIIILJ Ul U I I U I 715 nt 



(GC) 3 f (GGC) 4 (C) 16 (TG), 5 (TGC) 3 (GT) 9 

(GTG) S 

(CT) 3 

(CT^ (GO (GC^ (CA) 7 { (TG) 3 

L tropica I ■Mil II I I I n l~l I I ■ 1 1 II m I 704 nt 

(GCff (C)„ (GCG) 5 (TG), 0 (TGC) 3 (GT), 0 

(GTG) 5 

(CJ^ (GC^ (GC^ (CA) 7 (TG) 3 

Laethiopica F- ■ II U Ul U II ■ U II ■ I I I 696 nt 



U II U 

(GTG) 4 



(GGCUC),, (TG) 10 (TGC) 3 (GT) 8 



(C) 9 (GT) 3 (GC) 5 (CA)^ 

L.amazonsnsis \ II I . I I I II Mil II I I I II II II I 703 nt 

(GC)^ (GCG) 6 (TG) 4 (GCG) 5 (TG) 9 (TGC) 3 (GT), 4 

(GTG) 5 

(CA), (GT) 4 

L. braziliensis ID II U U □ U 51 7 nt 

(TG) 3 (GC), (GCG) 3 (C)„ (TG) 8 (GT) 10 



m (GT) 5 

L. peruviana III M I I I I II 516 nt 

(TG) 3 (GC), (GCG) 4 (TG) 5 (GT) S 



(GT^ 

L. guyanensis I u ll l_l l_l I I I I l_l I 585 nt 

(TG) 3 (Gq (GCG) 4 (TG) 5 (GT), (TG) 5 



(GT) 5 (GT) 5 

Llainsoni ill I I I mi I ill 589 nt 

(TG) 3 (GCG) ( (GT) 3 (TG) 5 

Figure 2 Microsatellite distribution in the 3'-UTR of HSP70-I genes in several Leishmania species. The repeated motifs correspond to those 
found in the sense strand. Note that the drawing scale is not proportional for the different Leishmania species; however, for a given seguence, the 
relative positions of the microsatellites are scaled. 
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U-JPCM5 

- Li-LEM75 
U-LLM1525 
U-LLM1050 

- Ld-LCR 
Ld-DD8 
Ld-DEVI 

Ld-HU3 



Q jr- Lmj-STRAIN 
10o P Lmj-5-ASKH 



L Lmj-Friedlin 

Lt-ADHANIS1 
Lt-K27 
Lae-L100 
- Lae-652 
LLae-101 

Lam-LBT 
Lam-pH8 
LI-CUM71 



100 



8 1 Lg-LC1446 

~L Lpa-L13 
Lp-LH249 
Lb-CUM45 
Lb-M2904 



B 



95 



100 



U-LLM1525 

- U-LEM75 
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Figure 3 Phylogeny of Leishmania according to the Neighbor-joining (panel A) and Minimum Evolution (panel B) methods based on 
the multiple sequence alignment of 3'UTR of HSP70-I gene. Numbers above the branch represent percent recovery in 1,000 bootstrap 
pseudoreplicates analysis (only values above 90 are shown). All positions containing gaps and missing data were eliminated from the dataset 
(complete deletion option). There were a total of 493 positions in the final dataset. Phylogenetic analyses were conducted in MEGA4 [32]. 
Distance represents the number of base substitutions per site. Species are abbreviated as follows: Lae, L aethiopica; Lam, L amazonensis; Lb, L 
braziliensis; Ld, L donovani; Lg, L. guyanensis; Li, L infantum; LI, L. lainsoni; Lmj, L. major; Lp, L. peruviana; Lpa, L. panamensis; Lt, L tropica. See 
Table 1 for additional information of the strains. 



Leishmania species. On the basis of digestion with Haelll 
restriction enzyme, our in silico analyses predicted seven 
different patterns (Table 2) that would allow the separation 
of the following species of groups of species: L. aethiopica 
/L. tropica, L. amazonensis, L. braziliensis/L. peruviana, L. 
donovani/L. infantum, L. guyanensis/L. panamensis, L. 
lainsoni and L. major. 

Conclusions 

Identification of the exact parasite species isolated from 
human and animal hosts, as well as from sand fly vectors, 
gains increasing importance for diagnostics, epidemio- 
logical surveillance, and clinical studies. As sequencing is 
becoming a cost-effective methodology, it is possible to 
envision the use of a battery of genes as genomic markers 
for phylogenetic analyses of Leishmania species towards 
the goal of both a molecular taxonomy and a definitive 
diagnostic method. Our data show the potential of the 
UTR of HSP70-I genes as a molecular marker for 
Leishmania species typing. In addition, this study has 
shown the existence of complex arrays of microsatellites in 
the UTR of HSP70-I, whose involvement in regulatory 



Table 2 In silico prediction of Haelll restriction fragments 
of the HSP70-I UTR amplification product in the 
Leishmania species analyzed in this study 



Leishmania spp 


PCR product 
size (pb) 


Haelll restriction 
fragments 


Fragment 
sizes (pb) 


L aethiopica 


738 


3 


317, 243, 178 


L amazonensis 


745 


3 


414, 221, 110 


L braziliensis 


559 


5 


229, 103, 99,81,47 


L donovani 


754 


4 


435,167,125, 27 


L guyanensis 


627 


3 


442, 104, 81 


L. infantum 


771 


4 


436, 182, 126, 27 


L lainsoni 


631 


4 


399, 104,81,47 


L major 


757 


4 


323, 186, 178, 70 


L panamensis 


629 


3 


444, 104, 81 


L peruviana 


558 


5 


227, 104, 99,81,47 


L tropica 


746 


3 


321, 243, 182 
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mechanisms of gene expression will be analyzed in future 
studies. 

Abbreviations 

MLEE: Multilocus enzyme electrophoresis; UTR: Untranslated region; 
CDR: Coding region. 
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